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The results of Komlos, Major and Tusnady give optimal Wiener 
approximation of partial sums of i.i.d. random variables and provide 
an extremely powerful tool in probability and statistical inference. 
Recently Wu [Ann. Probab. 35 (2007) 2294-2320] obtained Wiener 
approximation of a class of dependent stationary processes with fi- 
nite pth moments, 2 < p < 4, with error term o{n^^^ {log n)~'), 7 > 0, 
and Liu and Lin [Stochastic Process. Appl. 119 (2009) 249-280] re- 
moved the logarithmic factor, reaching the Komlos-Major-Tusnady 
bound 0(71^''^). No similar results exist for p > 4, and in fact, no ex- 
isting method for dependent approximation yields an a.s. rate better 
than o{n^^'^). In this paper we show that allowing a second Wiener 
component in the approximation, we can get rates near to o{n^^^) 
for arbitrary p> 2. This extends the scope of applications of the re- 
sults essentially, as we illustrate it by proving new limit theorems for 
increments of stochastic processes and statistical tests for short term 
(epidemic) changes in stationary processes. Our method works under 
a general weak dependence condition covering wide classes of linear 
and nonlinear time series models and classical dynamical systems. 

1. Introduction. Let X, Xi , X2, ... be i.i.d. random variables with mean 
and variance 1, and let Sn = Ylk<n-^k- Komlos, Major and Tusnady [25, 26] 
showed that if £J(e*l"^l) < 00 for some t>0 then, after suitably enlarging the 
probability space, there exists a Wiener process {W{t),t > 0} such that 

(1) S,, = l^(n) + 0(logn) a.s. 
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Also, if < oo for some p> 2, they proved the approximation 

(2) Sn = W{n) + o{n^/P) a.s. 

The remainder terms in (1) and (2) are optimal. In the case when only 
EX = 0, EX^ = 1 is assumed, Strassen [46] obtained 

(3) Sn = W{n) + o{{nloglogn)^/^) a.s. 

Without additional moment assumptions the rate in (3) is also optimal (see 
Major [29]). Relation (3) is a useful invariance principle for the law of the 
iterated logarithm; on the other hand, it does not imply the CLT for 
This difficulty was removed by Major [30] who showed that under EX = 0, 
EX^ = 1 there exists a Wiener process W and a numerical sequence n 
such that 

(4) 5„ = VF(r„) + o(nV2) 

Thus allowing a slight perturbation of the approximating Wiener process 
one can reach the remainder term o(n^/P) also for p = 2^ making the result 
applicable for a wide class of CLT-type results. The case of strong approx- 
imation under the moment condition EX'^h{\X\) < oo where h{x) = o{x^), 
X — )• oo, for any e > 0, has been cleared up completely by Einmahl [18]. 

The previous results, which settle the strong approximation problem for 
i.i.d. random variables with finite variances, provide powerful tools in prob- 
ability and statistical inference (see, e.g., Shorack and Wellner [45]). Start- 
ing with Strassen [47], a wide literature has dealt with extensions of the 
above results for weakly dependent sequences, but the existing results are 
much weaker than in the i.i.d. case. Recently, however, Wu [50] showed that 
for a large class of weakly dependent stationary sequences {^n} satisfying 
< oo, 2 <p < 4, we have the approximation 

Sn = W{n) + o{n^^P{logny) a.s. 

for some 7 > 0, and Liu and Lin [28] removed the logarithmic factor in 
the error term, reaching the optimal Komlos-Major-Tusnady bound. The 
proofs do not work for p > 4, and in fact, no existing method for dependent 
approximation yields an a.s. rate better than o(n^/^). On the other hand, 
many important limit theorems in probability and statistics involve norming 
sequences smaller than n^/^, making such results inaccessible by invariance 
methods. The purpose of the present paper is to fill this gap and provide 
a new type of approximation theorem reaching nearly the Komlos-Major- 
Tusnady rate for any p> 2. 

As noted above, reaching the error term o(n^/2) for i.i.d. sequences with 
finite variance requires a perturbation of the approximating Wiener pro- 
cess W. In the case of dependent processes we will also need a similar per- 
turbation, and, more essentially, we will include a second Wiener process in 
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the approximation, whose scaling factor is smaller than that of W, and thus 
it will not affect the asymptotic behavior of the main term. Specifically, for 
a large class of weakly dependent stationary processes {Ife} with finite pth 
moments, 2 <p < oo, we will prove the approximation 

n 

(5) Y.^k = W^isl) + W2{tl) + 0(n(i+^)/^') a.s., 

k=l 

where {Wi{t),t > 0} and {W2{t),t > 0} are standard Wiener processes, 
and Sn, tn are numerical sequences with 

2 2 .2-7 

s^r^ a n, cn' 

for some < 7 < 1, o"^ > 0, c> 0. The new element in (5) is the term W2{t^) 
which, by its smaller scaling, does not disturb the asymptotic properties 
of Wi{sl). Note that the processes Wi, W2 are not independent, but this 
will not present any difficulties in applications. (See also Proposition 1 in the 
next section.) The number rj depends on the weak dependence rate of {Y^} 
(introduced below), and can be made arbitrarily small under suitable rate 
conditions. 

For p > and a random variable Y, let = {E\Y\p)^^p. If A and B 

are subsets of Z, we let d{A, B) = inf{|a — 6| : a G A, 6 G i?}. 

Definition 1. Let {Yk,k G Z} be a stochastic process, let p > 1 and let 
5{m) — )• 0. We say that {Yk,k G Z} is weakly Ai-dependent in with rate 
function 5{-) if: 

(A) For any A; G Z, m G N one can find a random variable Y^^^ with finite 
pth moment such that 

||ifc-n(™)||^<5(m). 

(B) For any disjoint intervals Ii,. . . ,Ir (r G N) of integers and any posi- 
tive integers mi, . . . ,mr, the vectors {Yj'^^\j G /i}, . . . , {Yj"^''\j G Ir} are 
independent provided d{Ik,Ii) > ma,x{mk,mi} for 1 < k < I < r. 

We remark that our dependence condition is naturally preserved under 
smooth transformations. For example, if {Y^} is weakly A^-dependent in 
with rate S{-), and h is a Lipschitz a function (0 < a < 1) with Lipschitz 

constant K, then by the monotonicity of Hlfc — y^^™^||p in p we have 

\\h{Y,) - HY^^X < K\\Y, - )||^^ < K\\Y, - )||;, 

and thus {/i(yfc)} is also weakly A^-dependent in with rate function K5{-)". 
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Note that (B) implies that for any fixed m the sequence {Y"^ , /c G Z} is an 
m-dependent process. Hence, sequences satisfying conditions (A) and (B) are 
approximable, in the sense, by m-dependent processes of any fixed order 
m > 1 with termwise approximation error 5{m). In other words, sequences in 
Definition 1 are close to m-dependent sequences, the value of m depending on 
the required closeness, explaining the terminology. Since ||5^fc||p < + 

\\Yk — Y^\\p, condition (A) implies that E'lYfel^ is finite. Using L^-distance 
is convenient for our theorems, but, depending on the application, other 
distances can be used in part (A) of Definition 1. For example, defining (as 
usual) the Lq norm of a random variable X by 

||X||o = inf{e > : P{\X\ >e)< e}, 

condition (A) could be replaced by 

\\Yu-Yt\\,<6{m). 

Such a definition requires no moment assumptions and turns out to provide 
a useful dependence measure for studying empirical processes (see [3]). 

Trivially the previous definition covers m-dependent processes for any 
fixed m (see also Section 3.1), but, in contrast to the very restrictive condi- 
tion of m-dependence, weak A^-dependence holds for a huge class of station- 
ary sequences, including those studied in Wu [49, 50] and Liu and Lin [28]. 
In the case when {Yk,k € Z} allows a Wiener-Rosenblatt representation 

(6) Yk = f{ek,ek-i,...), keZ, 

with an i.i.d. sequence {ek,k G Z}, weak A^-dependence is very close to 
Wu's physical dependence condition in [49], except that we allow a larger 

freedom in choosing the approximating random variables Y^^\ compared 
with the choice in [49, 50] via coupling. (For sufficient criteria for the repre- 
sentation (6), see Rosenblatt [40-42].) Note that instead of (6) we may also 
assume a two-sided representation 

(7) Yk = f(...,ek-i,ek,ek+i,...), keZ, 

of {Ifc}. In case when {Yk,k G Z} allows the representation (7) with mix- 
ing {efe}. Definition 1 is a modified version of NED (see Section 3.2), a weak 
dependence condition which appeared already in Ibragimov [22] and has 
been brought forward in Billingsley [5] (see also [31, 32]). Later NED has 
been successfully used in the econometrics literature to establish weak de- 
pendence of dynamic time series models (see, e.g., [35]). In Section 3 we will 
discuss further the connection between weak -dependence with known 
weak dependence conditions. We stress that the definition of weak A^-de- 
pendence does not assume the representation (6) or (7), although it was 
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motivated by this case. The reason for using our more general definition 
is to illuminate the essential structural condition on {Y^} required for our 
theorems. Extensions of our results for "classical" mixing conditions, like a, 
/3, p mixing and their variants will be given in a subsequent paper. 

The main results of our paper are formulated in Section 2. In Section 3 
we give several examples. Applications of the theorems can be found in 
Sections 4 and 5, while Section 6 contains the proofs of the main theorems. 

2. Main theorems. We write <C hn if lim„_s.oo|an/fcn| < oo. 

Theorem 1. Let p> 2, t] > and let {1^, /c G Z} be a centered station- 
ary sequence, weakly Ai-dependent in with rate function 

(8) (5(m)<m"^, 
where 

(9) (1 + Wp<1/2. 
Then the series 

(10) cj^ = Y,EY^Yk 

is absolutely convergent, and {Ifc, A; G Z} can be redefined on a new probabil- 
ity space together with two Wiener processes {Wi{t),t > 0} and {W2{t),t > 
0} such that 

n 

(11) Y.^k = Wiisl) + W2{tl) + 0(n(i+^)/f ) a.s., 

k=l 

where {sn} ni^-d {tn} are nondecreasing numerical sequences with 

(12) ~ cj^n, ~ cri^ 
for some < 7 < 1, c> 0. 

Note that for any fixed p > 2 and < 77 < (p — 2)/2, condition (9) is 
satisfied if A is large enough, and thus Theorem 1 provides an a.s. invariance 
principle with remainder term close to the optimal remainder term o{n^^P) 
in the Komlos-Major-Tusnady approximation. 

It is natural to ask if Wi(s^) in (11) can be replaced by Wi{a'^n), a fact 
that would simplify applications. The proof of the theorem yields an Sfi with 

= iT^n + 0{n^~'') for some < e < 1, but for A barely exceeding the lower 
bound in (9), the explicit value of e is very small. Thus replacing Wi{s'^) 
by Wi{a'^n) introduces an additional error term that ruins the error term 
(^(•j^Ci+j?)/?^ in (11). The situation is similar to the Wiener approximation 
of partial sums of i.i.d. random variables with mean and variance 1 when 
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we have (4) with a numerical sequence n, but in general (4) does not 
hold with Tn =n. (See Major [29, 30].) Note, however, that in our case the 
large difference between and u^n is a consequence of the method, and we 
do not claim that another construction cannot yield the approximation (11) 
with = cr^n. However, the presence of in (11) does not limit the appli- 
cability of our strong invariance principle: and are explicitly calculable 
nonrandom numbers and as we will see, applying limit theorems for VFi(s^) 
is as easy as for Wi{(T'^n). 

As the proof of Theorem 1 will show, the sequences {s„} and {tn} in (11) 
have a complementary character. More precisely, there is a partition N = 
Gi U G2 (provided by the long and short blocks in a traditional blocking 
argument) and a representation 

n n 

sl = ^4, tl = J2^i (n = l,2,...) 

fc=l k=l 

such that cr|, converges to cr^ on Gi and equals on G2, and t| converges 
to cr^ on G2 and equals on Gi. In particular, 

(13) lim (s^ _ si) = lim (t^ _ tl) = ^7^ 

71— >oo n— >-oo 

and both liminf's are equal to 0. 

The numerical value of 7 in (12) plays no role in the applications in this 
paper, but for later applications we note that if 



2r/(l-eo)^V P 
for some < Eq < 1 , then we can choose 

(14) 7 = l-eo-'^^'-^° 



p-2(l + r/eo)' 

As we already mentioned in the Introduction, the processes Wi and W2 
are not independent. While for our applications this is not important, the 
following proposition might be useful for possible further applications. 

Proposition 1. Under the assumptions of Theorem 1 we have 

(15) Corr(VFi(s„), Vl/2(trn)) — 5" as m,n^ 00. 

Our next theorem is the analogue of Theorem 1 in the case of an expo- 
nential decay in the dependence condition. 

Theorem 2. Let p>2 and let {Yk,k G Z} be a centered stationary se- 
quence, weakly Ai-dependent in with rate function 

(16) 6{m) <^exp(—Qm), q>0. 
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Then the series (10) is absolutely convergent, and {Yk,k G Z} can be rede- 
fined on a new probability space together with two standard Wiener processes 
{Wi{t),t>0} and {W2{t),t>0} such that 

n 

(17) ^Yk = Wi{sl) + W2{tl) + 0{n'/Plog'n) a.s., 

k=l 

where {s„} and are nondecreasing numerical sequences such that s^^ ~ 
a'^n, ~ (7^?i/logn and (13) holds. 

Like in Theorem 1, ~ c^n can be sharpened to s,^ = u^n + 0{n/ logn); 
see the remarks after Theorem 1. 

Using the law of the iterated logarithm for W2, relation (11) implies 

n 

(18) ^y, = VF^(4) + o(ni/2-A) 

k=l 

for some A > 0, which is the standard form of strong invariance principles. 
However, since 7 in (12) is typically near to 1, the A in (18) can be very 
small, and thus the effect of the very strong error term 0(n(i+'')/P) in (11) 
is lost. 

The proof of the strong approximation theorems in Wu [50] depends on 
martingale approximation, while Liu and Lin [28] use approximation of the 
partial sums of {Ifc} by partial sums of m-dependent r.v.'s. Our approach 
differs from both, using a direct approximation of separated block sums 
of {Yk} by independent r.v.'s, an idea used earlier in [2-4, 21]. In this ap- 
proach, the second Wiener process W2 is provided by the sum of short block 
sums. The question if one can get a remainder term near o(n^^^) in the 
simple (one-term) Wiener approximation for any p> 2 remains open. 

3. Examples of weakly Al-dependent processes. The classical approach 
to weak dependence, developed in the seminal papers of Rosenblatt [39] 
and Ibragimov [22], uses the strong mixing property and its variants like /3, 
Q, 4> and V mixing, combined with a blocking technique to connect the 
partial sum behavior of {Y^} with that of independent random variables. 
This method yields very sharp results (for a complete account of the classical 
theory see Bradley [7]), but verifying mixing conditions of the above type 
is not easy and even when they apply (e.g., for Markov processes), they 
typically require strong smoothness conditions on the process. For example, 
for the AR(1) process 

Yk = \Yk-i + Ek 

with Bernoulli innovations, strong mixing fails to hold (cf. Andrews [1]). 
Recognizing this fact, an important line of research in probability theory in 
past years has been to find weak dependence conditions which are strong 
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enough to imply satisfactory asymptotic results, but which are sufficiently 
general to be satisfied in typical applications. Several conditions of this kind 
have been found, in particular by the French school (see [10-12, 16, 37, 38]). 
A different type of mixing conditions, the so-called physical and predictive 
dependence measures, have been introduced by Wu [49] for stationary pro- 
cesses {Ifc} admitting the representation (6) where {£k,k £ Z} is an i.i.d. 
sequence, and / : — )■ M is a Borel-measurable function. These conditions 
are particularly easy to handle, since they are defined in terms of the al- 
gorithms which generate the process {ifc}- Weak A^-dependence, although 
formally not requiring a representation of the form (6), is closely related to 
Wu's mixing conditions and works best for processes {Yfc} having a repre- 
sentation (6) or its two-sided version (7). The examples below will clear up 
the exact connection of our weak 7W -dependence condition with the mixing 
conditions in Wu [49, 50] and Liu and Lin [28]. 

3.1. m-dependent processes. Definition 1 implies that {1^, k gZ} can be 
approximated, for every m> 1, by an m-dependent process with termwise 
error 6{m). If {Ifc, /c G Z} itself is m-dependent for some fixed m = iuq and 
K := sup^jg^ll^fcllp < °o, then Definition 1 is satisfied with 



and = if n < mo and = if n > mo - In other words, m-dependent 
sequences with uniformly bounded norms are weakly 7W-dependent with 
the above parameters. It is worth mentioning that m-dependent processes 
in general do not have the representation (7) (see, e.g., [8, 13]). 

3.2. NED processes. Under (7) our condition can be directly compared 
to NED. We recall: 

Definition 2 (NED). A sequence {Yk,k £ Z} having representation (7) 
is called NED over {£k} under L^-norm with rate function 6{-) if for any 
A: G Z, 771 > 1, 



where J^^^Z^ is the cr-algebra generated by £k-mi ■ ■ ■ i^k+m- 

Clearly, if {et} is an independent sequence, then Y^"^^ = E[Yk\T^^!^] sat- 
isfies (B) of Definition 1. Hence if {Y^} is NED over {sk} in L^-norm with 
rate function 6{-) where {sk} is an independent sequence, then {Ifc} is weakly 
A^-dependent with the same p, 6{-). 

As our examples below will show, for weakly A^-dependent sequences the 
construction for Y^"^^ is not restricted to E[Y}:\T^^^], but is often more 
conveniently established by truncation or coupling methods. 




if j < mo 
if j > mo 



||y,-i?[n|j-,^+-]||^<5(m) 
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3.3. Linear processes. Let = ^JL-oo'^j^k-j with the i.i.d. innovations 
{sj, j G Z}. If aj = for j < 0, then the sequence {Yk,k G Z} is causal. Liu 
and Lin [28] and Wang, Lin and Gulati [48] studied strong approximations 
of the partial sums with Gaussian processes (in the short- and long-memory 
cases). 

We define Y^""^ as 

lm/2] 
j=-[m/2] 

This directly ensures that condition (B) holds. To verify condition (8) we will 
assume that -Eleol*' < c« for some p > 2 as well as |aj| <C [jI"^"^^^^ (j — oo). 
Then we get, using the Minkowski inequality. 



y I I Liz 1 O J I I p 



\j\>m./2 
\j\>m/2 

\j\>m/2 

Thus if A is large enough. Theorem 1 applies. Obviously if |aj| <^ p'-^' with 
some < p < 1, then (16) holds, and Theorem 2 applies. 

3.4. Nonlinear time series. Let the time series {Y^, /c G Z} be defined by 
the stochastic recurrence equation 

(19) Yk = G{Yk.i,ek), 

where G is a measurable function, and {£k,k G Z} is an i.i.d. sequence. For 
example, ARCH(l) processes (see, e.g., Engle [19]) which play an important 
role in the econometrics literature, are included in this setting. Sufficient con- 
ditions for the existence of a stationary solution of (19) can be found in Dia- 
conis and Freedman [14]. Note that iterating (19) yields Y^ = /(... ,ek-i,£k) 
for some measurable function /. This suggests defining the approximating 

random var iables Y^""^ as Y^""^ = f{. . . ,0,0 , efc_m) • • • ) Efc)- Note, however, 
that this definition does not guarantee the convergence and thus the exis- 
tence of Y^"^^ . The coupling used by Wu [49] , avoids this problem by defining 

-t^(m) J,/ (fc) (k) s 

— J\- ■ ■ 1 ^fc_m-2' ^fc-m-1' • • • i^fc/' 

where {£k\ke Z}, / = 1,2,..., are i.i.d. sequences with the same distri- 
bution as {£k,k G Z} which are independent of each other and of the {sk, 
/c G Z}. These random variables satisfy condition (B). Results from Wu and 
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Shao [52] show that under some simple technical assumption on G, 

rfc-n^"^llp«exp(-pm) 
holds with some p> and p> 0. Thus for p> 2, Theorem 2 applies. 

3.5. Augmented GARCH sequences. Augmented GARCH sequences were 
introduced by Duan [17] and turned out to be very useful in applications 
in macroeconomics and finance. The model is quite general and many pop- 
ular processes are included in its framework. Among others the well-known 
GARCH [6], AGARCH [15] and EGARCH model [34] are covered. We con- 
sider the special case of augmented GARCH(1,1) sequences, that is, se- 
quences {Yi^,k £ Z} defined by 

(20) Yk = akek, 
where the conditional variance ct^ is given by 

(21) H4) = c(efc-i)A(ai_i) + giek-i). 

Here {ek,k G Z} is a sequence of i.i.d. errors, and A(x), c{x) and g{x) are 
real- valued measurable functions. To solve (21) for cr^ one usually assumes 
that A~^(x) exists. Necessary and sufficient conditions for the existence of 
a strictly stationary solution of (20) and (21) were given by Duan [17] and 
Aue, Berkes and Horvath [2]. Under some technical conditions stated in 
Hermann [21] (Lemmas 1, 2 and Remark 2) one can show that augmented 
GARCH sequences are weakly A^-dependent in L^-norm with exponential 
rate. 

Note that the above models have short memory; long memory models 
(see, e.g., [20]) have completely different properties. 

3.6. Linear processes with dependent innovations. Linear processes = 

Ojlfc-j with dependent innovations {Yk} have obtained considerable 
interest in the financial literature. A common example are autoregressive 
(AR) processes with augmented GARCH innovations (see, e.g., [27]). 

Assume that {Yk,k G Z} is weakly A^-dependent in with rate func- 
tion 6{-). In combination with the results of Section 3.3 one can easily obtain 
conditions on 5 assuring that the linear process {Zk,k S Z} defined above 
is also weakly A^-dependent in L^-norm with a rate function 6* depending 
on (cj) and 6. 

Strong approximation results for linear processes with dependent errors 
were also obtained by Wu and Min [51]. 

3.7. Ergodic sums. Let / be a real measurable function with period 1 
such that Jg f{(jj) duj = and Jq \ f{uj)\P dco < oo for some p>2. Set 

n 

5n,(^) = ^/(2M, ^e[o,i), 

k=l 
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and = S^{uj) dio. Then 5„ defines a partial sum process on the prob- 
abihty space ([0, 1), 5[o^i), A[o,i)), where -Sp^i) and \o,i) &re the Borel a- 
algebra and Lebesgue measure on [0,1). The strong law of large numbers 
for f{2''uj) is a consequence of the ergodic theorem, for central and functional 
central limit theorems see Kac [24], Ibragimov [23] and Billingsley [5]. 

Let Ifc(w) = /(2^a;), and define the random variable efc(w) to be equal to 
the fcth digit in the binary expansion of w. Ambiguity can be avoided by the 
convention to take terminating expansions whenever possible. Then {e^} is 
an i.i.d. sequence, and we have = ±1, each with probability 1/2. This 
gives the representation 

Yk = f ^y^£fc+j2~-^^ = g{£k+i,£k+2, ■ ■ •)• 

We can now make use of the coupling method described in Section 3.4 and 
the approximations 

-t^(m) / (fc) (fc) N 

^k - 9\£k+l , £k+2, Sfc+m, Sfc+m+i , efc+m.4-2 ' ' • • J • 

Changing for some u G [0, 1) the digits ek{uj) for k > m will give an lo' with 
|w — w'l < 2~™. If / is Lipschitz continuous of some order 7, then we have 

|yfc-y,(™)| = o(2-^™), 

and thus for any p> 1 {Yj.} is weakly -dependent in L^-norm with an 
exponentially decaying rate function. 



4. Increments of stochastic processes. For arbitrary A > 0, relation (18) 
has many useful applications in probability and statistics. For example, it 
implies a large class of limit theorems on CLT and LIL behavior and for 
various other functionals of weakly dependent sequences. However, many 
refined limit theorems for partial sums require a remainder term better 
than 0(n^/^), and no existing method for dependent sequences provides 
such a remainder term. The purpose of the next two sections is to show how 
to deal with such limit theorems via our approximation results in Section 2. 

Let {Yk,k G Z} be a stationary random sequence, and let < < n be 
a nondecreasing sequence of real numbers. In this section, we investigate the 
order of magnitude of 

max max 

l<k<n—an l<l<a, 

Such results have been obtained by Csorgo and Revesz [9] for i.i.d. sequences 
and the Wiener process. In particular, they obtained the following result ([9], 
Theorem 1.2.1). 



k+t 



7=fc+l 
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Theorem 3. Let {aT,T> 0} be a positive nondecreasing function sat- 
isfying: 

(a) 0<ar<r; 

(b) T/aT is nondecreasing. 



Set 

(22) Pt = [ 2aT 

Then 



T 

log h log log T 



-1/2 



lim max max PT\W{t + s) -W{t)\ = l. 

T^ooO<t<T—aT 0<s<aT 

Using strong invariance, a similar result can be obtained for partial sums 
of i.i.d. random variables under suitable moment conditions (see [9], pa- 
ges 115-118). For slowly growing a-r, this requires a very good remainder 
term in the Wiener approximation of partial sums, using the full power 
of the Komlos-Major-Tusnady theorems. As an application of our main 
theorems in Section 2, we now extend Theorem 3 for dependent stationary 
processes. To simplify the formulation and to clarify the connection between 
the remainder term in our approximation theorems in Section 2 and the 
increment problem, we introduce the following assumption. 

Assumption 1. Let {y^} be a random sequence which can be redefined 
on a new probability space together with two standard Wiener processes 
{Wi{t),t>0} and {W2lt),t>0} such that 

n 

(23) Y.^k = Wi{sl) + W2{tl) + 0{Er,) a.s., 

k=l 

where {En} is some given sequence and {s^} and {t^} are nondecreasing 
sequences satisfying 

(24) sl^a^n, tl = o{n), JkK {sl^^ - si) = ^ (tl^^ - tl) = a"^ . 

fe— >oo fe— >oo 

We will prove the following result. 

Theorem 4. Let {1^} be a sequence of random variables satisfying As- 
sumption 1 and put Sn = ^k=i ^k- Let ax be a positive nondecreasing func- 
tion such that: 

(a) 0<ar<r; 

(b) T/aT is nondecreasing; 

(c) ax is regularly varying at oo with index g £ (0, 1]. 
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Let Pt be defined by (22). Then under the condition 
(25) I3tEt = oil) 




Given a function ax and a weakly 7W-dependent sequence {Ifc} with pa- 
rameters p, (5(-), we can compute, using Theorem 4, a rate of decrease for 
5{-) and a value for p > 2 such that the fluctuation result (26) holds. For 
example, if ut = T'^, < a < 1, then (26) holds if p > 4/a and 6{m) <^ 



We note that for i.i.d. observations only assumptions (a) and (b) are 
required. It remains open whether a more general version of our Theorem 4 
which does not require assumption (c) can be proved. 

Recently Zholud [53] obtained a distributional version of Theorem 3 by 
showing that the functional 



converges weakly, suitably centered and normalized, to the extremal dis- 
tribution with distribution function . Using this fact and our a.s. in- 
variance principles, a distributional version of Theorem 4 can be obtained 
easily. Since the argument is similar to that for (26), we omit the de- 
tails. 

Let 1^1 be the cardinality of a set A. For the proof of Theorem 4 we need 
the following simple lemma. 

Lemma 1. Assume that {dk,k > 1} is a nonincreasing sequence of posi- 
tive numbers such that Yl^=i dk = oo. Let j4 C N have positive density, that 



Then X)^^ dkl{k eA} = oo. 

Proof. First note that by our assumption we have I{k G A} > /in 

for some /i > as long as n>nQ. Using Abel summation we can write 



m 



-p/2 




IS 



liminf |An {1, . . . > 0. 



n 



n-1 




Hence, by our assumptions 



n-1 




(n — )• oo). 
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From dk — (ifc+i > it follows (again using the Abel summation) that for 
n > no 

n 
k=l 

n n—1 k 

= dnJ2HkeA} + Y,idk- 4+1) Yl ^ ^} 

k=l k=l j=l 

> ^\ndn+ k{dk - dk+i)\ ^ 00 (n— ;>oo). □ 

\ k=nQ / 

Proof of Theorem 4. For the sake of simphcity we carry out the 
proof for cr = 1. From (23) and the triangular inequality we infer that 

lim max max /3„ | Sk+i — Sk \ 

n~^oo l<k<n—an l<£<a„ 

n— 5>oo l<fc<n— a„ l<^<a„ 

+ h^ rnax max /3„ | W2 (il+t) - W2 (4 ) | 

n— >oo 

= Ai+A2 + A3. 

By (25) A3 = 0. Since — )• 00 [this is implicit in (c)], we conclude from (24) 
that for any e > some no exists, such that for all n>no 

^■up{sl+a„ - 4} < (1 + and < (1 + e)n. 

k>l 

Set T = (1 + e)n, and define aT,e = (1 + £)ax/(i-{-e)- Then a^^^e satisfies (a) 
and (b) and for n>no we have 

max max Pn\Wi{sl^f) - Wi{sl)\ 

l<K<n—an l<t<an 

< sup sup l3n\Wi{t + s)-Wi{t)\ 

Q<t<sl „ 0<s<(l+e)a„ 

< sup sup /3T/(i+.)|VFi(t + s)-W^i(t)|. 

0<t<T-aT.e 0<S<aT,e 



Let 



T 

log h log log T 



-1/2 



By application of Theorem 1.2.1 in Csorgo and Revesz [9] [which requires (a) 
and (b)] we get 

lim sup sup f3T,e\Wi{t + s) — Wi{t)\ = 1 a.s. 

T-)-oo o<t<r-aT £ 0<s<aT,e 
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Since \\m.T-fooh/{i+£)/ I^T,e = (1 + and e can be chosen arbitrarily 

small, we have shown that Ai<l a.s. 

It is not surprising that due to (24) similar arguments will lead to ^2 = 
a.s. 

The proof will be completed if we show that Ai > 1. Let {n^.} be a nonde- 
creasing sequence of integers with — t- cxo. By (23), the triangular inequality 
and ^2 = ^3 = we obtain 

lim max max /3„ | Sk+i — Sk \ 

n— >-oo l<fc<n— ttn l<l<an 

n~>-ool<k<n~a„l<£<a„ 

> /3„ J w^i (4 ) - M^i (4 J I . 

We now proceed similarly as in Csorgo and Revesz [9] for the proof of Step 2 
of their Theorem 1.2.1. We will distinguish between the cases limaT/r = p 
with p <1 and /o = 1. Since both times we can use the same conceptual idea, 
we shall treat here only p <1. 

Set ni = 1. Given n^, define Uk+i such that Uk+i — an^+i = nk- This equa- 
tion will, in general, have no integer solutions, but for the sake of simplicity 
we assume that (n^) and (a„j.) are Z-valued. Since (s^) is nondecreasing, we 
conclude that the increments A(A;) = Wi{s'^^) — Wi{s'^^_^) are independent. 
By the second Borel-Cantelli lemma it suffices to show now that 

00 

(27) ^P(/3nJA(A:)| >l-e)=oo for ah e > 0. 

k=l 

For all large enough A; G N for which s^^ — s^^.i — ~^l'^)^nk ) ^^e estimates 
in [9] give 



P(/3„JA(A;)| >!-£)> 



a 



Hk lognfc 



l-e 



It is also shown in [9] that Yl'k^=ii^i{^^^^y ^ = 00. Thus, in view of Lemma 1 
it remains to show that A = {k'> l\s^^ — s^^. ^ > (1 — £/2)anfe} has a positive 
density. By (24) we have 



fc 



i=2 



<C'o ^ {rij -nj^i)/nk+ ^ {1 - e/2){nj - nj^i)/nk 



2<j<k 2<j<k 



16 



I. BERKES, S. HORMANN AND J. SCHAUER 



<Co ^ {uj -nj_i)/nk + {1 -ni/nk){l - e/2) 

2<j<k 

jeA 

for some Cq > which is independent of k. Now if A had density zero, 
the hmsup of the right-hand side of the last relation would be 1 — e/2. 
This can be easily proved, using that (uj — rej-i) is regularly varying by 
assumption (c). The liminf of the left-hand side above is 1. Thus A must 
have positive density and the proof is complete. □ 

5. Change-point tests with an epidemic alternative. In this section we 
apply our invariance principles to a change-point problem. Let {Yfc, fc G Z} 
be a zero mean process. Further let = Ifc + /i^, where /i/j, k gZ, are 
unknown constants. We want to test the hypothesis 

(^^o) /"I =/^2 = • • • = /in = /^ 

against the "epidemic alternative" 

There exist 1 < mi < m2 < n such that fj,k = fJ- for 
(Ha) A; G {1, . . . , n} \ {mi + 1, . . . , 7112} and /xfc = /i + A 

if A; G {mi + 1, . . . ,1712}. 
It should be noted that the variables mi, m2 and A may depend on the sam- 
ple size n. As it is common in the change-point literature, this dependence 
is suppressed in the notation. 

Without loss of generality we assume that a = 1. To detect a possible 
epidemic change it is natural to compare the increments of the process to 
a proportion of the total sum. More specifically, assume for the moment 
that Xk are independent and that we know when the epidemic starts and 

ends. Set Sk = Xi H \-Xk. Then by the law of large numbers 1(1711,1712) = 

\Sm2 ~ — {^2 — mi)Sn/n\ ^ m2 — mi. If no change occurs, however, by 
the central limit theorem I{mi,m2) = Op{yJm2 — mi). In general we do not 
know mi and m2- Thus, a natural test statistic is 

max \Sj — Si — {j — i)Sn/n\. 

l<i<j<n 

Clearly we are required to normalize the above test statistic appropriately. 
Following Rackauskas and Suquet [36] we define 

TTT/ \ -1/2 \Sj — Si — {j — i)Sn/n\ 
UI(n,a)=n ' max ,// ■ ■^ , x /- 7-. , ^ 

with 0<a<l/2. Aswe will see below, the parameter a plays an important 
role. The closer a is to 1/2, the "shorter" epidemics can be detected with 
this test. The price, however, is that in order to obtain the limiting law 
under (Hq) with "large" a (close to 1/2) requires a.s. invariance principles 
with error n^, e close to zero. Choosing a > 1/2 would result in a degenerate 
limiting distribution under {Hq). 
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Proposition 2 [Asymptotics under (Hq)]. If the stationary sequence 
{Yk,k € Z} satisfies Assumption 1 with En = o(n^/^~") and (Hq) holds, 
then 

-ijjj, .V \B{t)-B{s)\ 
a Ui[n,a) — > sup 77 r— -. ttt— , 

where {B{t),t G [0, 1]} is a Brownian bridge. 

Proof. Using (23) and assuming for simplicity that o" = 1, we obtain 

\W^{s])-WM)-{j-i)W,{sl)/n\ 



max 



i<i<i<n [((j - i)/n)(l - (j - i)/n)Y 



^^_i/2 \W2{t^)-W2{t})-{j-i)W2{tl)/n\ 
max t/^.^ + n^^/^ t(2)_^o(1). 

l<i<j<n ^'-^ l<'i<i<n 



It is easy to see that n -"^/^ maxi<j<j<„ T^*'^'* tends to zero. Since we can get 
a similar lower bound for UI{n,a), we have 

UI(n,a)=n-^/^ max t/^^+op(1), 

where Mn = ^i < j ^n}. Let us partition 7W„ into 

-A^i.n = {(«, < « < j < n;n-fn<j -i<n{l- 7^)}, 
A^2,n = <i<j < n;njn>j - i} 

and 

A^3,n = {(i, j)| 1 < « < J < n; j - i > n(l - 7„)}, 

where 7n — will be defined later. By our assumptions on the sequence {sj} 
there exists a r > such that s| — < r(j — i) for all 1 < i < i and that 
Sn — i'^~ ''"7n)^ if n > no. We have for large enough n 

-1/2 rp(l) 

re ' max J - • 

(i,i)6A42.„ 

<2„"-/^ „,ax sup sup (m±fLl^U„,(i). 
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For arbitrary e > we get by Lemma 1.2.1 in Csorgo and Revesz [9] that 
there is a constant C which is independent of n and e such that 

P\ max sup sup <; — > > en ' 



max sup sup <; — 

l</i<n7„o<(<(2_r7„)nO<s<r/i I h"" 

<^P( sup sup \Wi{t + s)-Wi{t)\>eh^'^{n/hYl'^^'' 



h=l 

nSn 



h=l 

Hence max(j .,)g_v(2 „ ^/J^ = op{l). In the same fashion one can show 

that n~^/^ maX(-j ^ Tj'-J^ = op(l). Therefore 

UI(n,a) = n~^/2 max t/^^ + op(l). 
Some further basic estimates give 

-1/2 rpil) 

n ' max i ,■ ■ 

= n"i/2 max - ^i(^) - - 

(ijSi.n [((j-i)/n)(l-(j-f)/n)]" 



n 



-1/2 



2^ 



+ 0^max|t^i(i)-t^i(., 

Since ~ n there is a null sequence {e„} such that maxi<j<„ |^ — s?| < e^n. 
Hence 

ma^\Wi{i) -Wi{s^)\< sup sup + s) - 

l<i<n 0<t<n0<s<2€„n 

Setting 7„ = e„ and applying again Lemma 1.2.1 in [9] it can be seen that 
sup sup \Wiit + s)-Wi{t)\=op{n^/^^^). 

0<t<n0<s<2t„n 

Consequently 

jjjl . -1/2 \Wi{j)-Wi{i)-{j-i)Wi{n)/n\ 

UI(n,a)=n ' max ., , — ., , ,, 

{i,i)eA4i,„ [((J-^)/n)(l-(J-^)/n)]- 

(28) 

+ op(l). 

Since the line of argumentation is very similar to what we have shown before, 
we note now without proof that Mi^n in the right-hand side of (28) can be 
replaced by A^n- The rest of the proof of Proposition 2 is standard. □ 
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The next proposition shows that this test is consistent. Let ^ = m2 — m\ 
denote the length of the epidemic. 

Proposition 3 [Asymptotics under {Ha)]- Let {Yk,keZ} be a mean 
zero process, weakly Ai-dependent in with p>2 and 6{-) satisfying 



< oo. 

m>l 

Let Xk = Yk + fik, A; E Z. Assume that (Ha) holds and that 

(29) lim l^i!^_^^|A|=oo. 

Then UI{n,a) oo . 

Proof. Under the alternative hypothesis (Ha) we have = Yfc + A* 
for G {1, . . . , n} \ {mi + 1, . . . , 7712} and X/^ = Y^. + ^ + A for k £ {mi + 
1, . . . ,7712}. To find a lower bound for UI{n,a) we study the numerator of 
the test statistic corresponding to the true epidemic. Thus we look at 

Sm2 - Smi - Sn{m2/n - mi/n) 

= {I- i/n){Sm2 - Smi) - ii/n){Sn - {Sm2 " S^J) 

111 l>\ / mi n \ 

j=mi + l Vi=l j=m2+l / 

n 

With the help of the moment inequality stated in Proposition 4 below we 
get 

Var(n-i/2^„) = 0((1 - l/nf{^/n) + {l/nf{l - iju) 

+ 2{l/nf''^(l-l/nfl^) 
= 0{{l/n){l-l/n)), 
and thus n~^I^Rn = Op((^/n)V2(i _ £/n)i/2). Thus we have shown that 
UI{n, a) > ni/2((£/n)(l - i/n)Y~''\A\ 

Opm/n){l-£/n)f'-'') 



(30) 

(fin - PW-O" ^ 



(£(n -£))!-" 



Op(((£/n)(l-£/n))i/2-"). 
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To conclude the proof we note that hm„_!.oo((^/f^)(l — = if 

I = o{n) [or n — I = o{n), resp.] and {{l/n){l — ^/n))^/^~° < 1 in general. 
Consequently condition (29) together with relation (30) finishes the proof. 
□ 



For example, if A is independent of n, then condition (29) will hold for 
t ~ cn, c € (0, 1). In case that n'^ <^n — n'^, > 0, condition (29) holds 
provided that (1 — 2a)/(l — a) < 2z/. That is, choosing a close to 1/2 allows 
us to detect relatively "short" ("long") epidemics. 



6. Proof of the main theorems. 



6.1. A moment inequality. In the proofs of our theorems we will use the 
following moment inequality which may be of separate interest. 

Proposition 4. Let {Yk,k gZ} be a centered .stationary sequence, weak- 
ly M-dependent in with p>2 and a rate function 6{-) satisfying 



Dp := 5{m) < oo. 



m=0 



Then for any n S N, 6 S Z we have 



(31) 



E 



b+n 
k=b+l 



where Cp is a constant depending on p and the sequence {Yk}. 

Proof. By stationarity, we can assume 6 = 0. Let first p = 2. We use 
below that sup„^>Q || y^*-™^ ||p < ||Yi||p + Dp. Without loss of generality we 



assume that EY, 



(m) 



for all k gTj and m G N. Since 



Y.Yk+j = {Y, - Yt' '^)Y,^j + Yi'-'\Y,^j 



^k+j ) 



+ Y, 



{i-i)v(J-i) 



■ k ^ k+j 

we get by assumption (B) that for j > 1 

.11 _L I f?rvO'-i) 



(32) 



EYkYk+,\ < \Em - n^'" + \E[Y;:'~''iYk+, - Y^l"^ 

k+j 



< WYk+jhWYk - Y^^ ^^||2 + ll^i'^' ^^Il2ll^fc+i - '^fci/^ll2 



<m+,h + \\Yt'^\\,)5u-i) 

<{2\\Y,\\2 + D2)6{j-1). 
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From relation (32) we infer, letting Sn = Y12=i^k, 

n 

ESl = Y,EYi + 2 ^™ 
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k=l 



l<k<l<n 



<n||yi||2 + 2 Yl \EYkYk+i\ + ---+ J2 \EYkYk+n-2\ + E\YiYn 

'-l<fe<n.-l l<fc<2 

< n||yi||^ + 2(2||yi||2 + D2)[{n - 1)5{0) + ■■■ + 2,5(n - 3) + <5(n - 2)] 

< n{\\Yi\\l + 2D2{2\\Y,\\2 + D2)) =: C2n. 

This shows (31) for p=2. 

Once (31) is established for p, it holds for all < q < p. Indeed, by Lya- 
punov's inequality, relation (31) implies 



(33) 



E 



b+n 
k=b+l 



for any < q <p. In particular, (31) holds with p = l. 

Next we prove (31) for all integers p> 2. Clearly, if Cp > ||5^i||p, then the 
inequality 



(34) 



E\Sn\P < CpHP/"^ 



holds for n = 1. Using a double induction argument, we show now that for 
some constant Cp, relation (34) holds for all n E N. More precisely, we show 
that if (34) holds for p — 1 and all n G N and also for p and n < no, then it 
will also hold for p and n < 2nQ . 

For k < n put 5^ = Yk + Ifc+i H h Tn- We have 



E\S2n\^ — E\Sn + S^li F 



E 



n "T ^n+ll 
n 



(fc-i), 

+k > 



k=l 



k=l 



n n 

E^f'+E^-k'' 

k=l k=l 



(35) 



(fc-i)i 



\k=l 



+ 



k=l 



+k 



k=l 



k=l 



22 



I. BERKES, S. HORMANN AND J. SCHAUER 



< 



2Dp + 



En 

k=l 



+k 



k=l 



(36) 



--■.i2Dp+\\Zn+Wn\\pr. 



For some positive constants ipp that will be specified later, we choose Cp so 
that Cp^^ > Dp/ 1pp. Then if n < no 

E\Zn\^ < {\\Sn\\p + \\Sn — Z^WpY 
< {\\Sn\\p + Dpr 

<{l + iPprCpnP/\ 

By the induction assumption, this relation holds with arbitrary n for all 
integer moments of order <p — 1. The same estimate applies for 
Due to assumption (B) in Definition 1, the random variables Z„ and Wn are 
independent. Thus 



< ElZJP + EIWJP + 



(37) 



E 

m=l 



E\Z„rE\WJP-"' 



2{1 + il;p)PCp + 



p-i 

E 

m=l 



(i + V'^)™(i + V', 



p—m J 



\p—m 



CmCp—m 



--■.nP/^[2{l+iPprCp + Rp] 



Hence (36) and (37) and our assumptions on Cp imply that 



(38) 



< CpnP/\2iPp + [2(1 + iPp)P + Rp/C, 



]i/p\p 



Choosing tpp small enough, and then choosing Cp large enough, we can 
always achieve that the term in brackets of (38) is < -v/2, provided that p> 2, 
and that the inequality Cp^^ > Dp/ipp mentioned before is satisfied. Hence 
we have for every n <no that E\S2n\P < Cp{2n)P/'^ , proving (34) for all even 
numbers n < 2nQ. The case of odd n is similar. The proof of Proposition 4 
is finished for integer p. 

For general p > 2 we have by the result shown before that (31) holds 
for [p\ . (As usual, [ p\ denotes the integer part of the real number p.) To 
finish the proof we need the following inequality which will be proven below: 

, , \a + b\P<\a\P + \b\P 
(39) 



LpJ 

+ E 

k=l 



i\af\br'' + \bf\ar'), pG[l,oo 



p-k\ 
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Using (39) we get a similar estimate for E\Zn + Wn\^ as in (37) and the 
proof can be finished along the same lines as for integer p. 

Verification of (39): Let x G [0, 1]. We recall that (1 + j;)^ can be expanded 
in the binomial series 



A:>0 



with 
(40) 



p\ _p{p-l)---{p-k + l) 
k 



kl 



From (40) it is clear that for k> [pj + 2 we have sign{(^)} = (— l)'^ LpJ+i. 
This immediately yields for k= [p\ + 2i with i>l, 



Consequently 



p 
k + 1 



p 
k + 1 



E 

fc>[pJ+2 



and 
(41) 



fc=0 ^ ^ 



Now consider \a + 6|^. If \a\ > \b\, then we infer from (41) that 

LpJ+1 / \ , fc 



\a + b\P<\a\P{ 1 + 
LpJ 



k=0 



+E(^)w'i 

k=i ^ ^ 



+ 



W+1 



yp—k 



[pj+l-p 



Thus (39) follows from (^pll^i) I „ I ^^^^"^ ^ < 1. Interchanging the roles of a 
and h completes the proof. □ 



Using Moricz [33], Theorem 1, we get: 
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Corollary 1. Under the assumptions of Proposition 4 with p>2, we 
have for any 2 < q <p and any n € N, 6 G Z 



E max 

l<k<n 



b+k 

j=b+i 



where the constants C^^ only depend on p,q and the sequence {Y^}. 

A slightly weaker result can also be derived from Proposition 4 for the 
case of < g < 2. 

6.2. Proofs of Theorems 1 and 2. We give the proof of Theorem 1. Note 
first of ah that 6{m) = \\Yk- Y^"^^ \\p > \\Yk - F^™^ h, and consequently (32) 
holds when the L^-norm is replaced by the L^-norm. Since A> 1 in (9), we 
infer that the series in (10) is absolutely convergent. 

Let us specify some constants that will be used for the proof. By our 
assumption on A it is possible to find a constant < eo < 1/2 such that 



A> 



2r/(l-eo)' 



1 



Then we set 



(42) 



/3 



1 + a 
/3 = (1 -eo)a. 



with a ■■ 



2??(l-£o) 
p-2{l + r])- 



For some ei > (which will be specified later) we now define = [eifc*^] . 
The first step in the proof of (11) is to show that it is sufficient to provide 

the strong approximation for the perturbed sequence Y^ = Y^^''\ We notice 
that our main assumption (8) yields ||Yfc — Y^\\p ^ k~^^ . If A6 < 1, then 



P 



max 

2"<fc<2"+i 



> 



]_2in/p){i+n) 



n 



< p^'^ \Yj - y/i > i2("/p)(i+^)^ 
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where ci = (1 + r/) — (1 — A5)p > 0. Thus by the Borel-Canteni lemma we 
have almost surely 

k k 

^y^. = ^y; + o(fc(i+'')/f) a.s. 

If A6 > 1 we get an (even better) error term of order o{k^^'P). 

The main part of the proof of Theorem 1 is based on a blocking argument. 
We partition N into disjoint blocks 

N = Ji U Ii U J2 U /2 U • • • , 

where = [^"J and \ Jk\ = [k^\ with a, (3 as in (42). Let us further set 

Ik = {ik,---,ik} and Jfe = {j^, . . . , j^} 

and 

j&h 3&Jk 

Note that ~ik = 0(A;^+"). Provided that £1 in the definition of ra^ is chosen 
small enough, this will imply that 

\Jk\ = > [eiiiJ ="^ife> 

and hence by assumption (B) it follows that {^fc} and {%} each define 
a sequence of independent random variables. 

The following lemma by Sakhanenko [43] (cf. also Shao [44]) is our crucial 
ingredient for the construction of the approximating processes. 

Lemma 2. Let {^fc} he a sequence of centered independent random vari- 
ables with finite pth moments, p>2. Then we can redefine {^fc} on a suitable 
probability space, together with a sequence {S,^} of independent normal ran- 
dom variables with ES,^. = 0, -E(C^)^ = E^f, such that for any x > 0, m > 1 

k k 



■ 



max 

l<k<m 



where C is an absolute constant. 



xP 



We shall now apply Lemma 2 to the sequences {S,k} and {rjk}- For this 
purpose we need estimates of the moments E\r]k\^. By Minkowski's 

inequality and Proposition 4 we get 

mkf < ( 

jelk P j&h 



ieh P j&h ^ 

0((|4l'/' + l4|-ir^')"). 
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Some easy algebra shows that the restrictions on the parameters A, 5, a 
and Eq imply 

A similar estimate holds for E\rjk\^. Hence we can find constants Fp such 
that 

mk\''<Fp\h\p/^ 

and 

where Fp does not depend on k. 

Let Ln = X^fc=i \Ik\- Then L„ = 0{n^^'^°'^). By our previous estimates and 
by Lemma 2 we infer that, after enlarging the probability space, we have 

/ k k \ 



P max 

\ 2"<fc<2"+i 



j=i i=i 



^ r{l+'?)/P 



2n + l 

-{1+'?) 



(43) < L^-i'""'' E ^1^^ 

fe=i 

-(l+a)(l+»7)+ap/2+l]n 



= 0(2' 

where is a sequence of independent and centered normal random variables 
with = E^l. The exponent in (43) will be negative if (1 + a)(l + ry) > 

^ + 1. This is equivalent to a < p_2{l+r}) ' '^hich follows by (42). Thus, by 
the Borel-Cantelli lemma we obtain 

Eo=Ee;+o(4^''^/^) 

j=i i=i 

By further enlarging the probability space we can write 



k / k \ 

E^^ = ^i Ev^^(^^) +0(4' 

j = l \7 = 1 / 



where {M^i(t),t > 0} is a standard Wiener process. The same arguments 
show that 



k / k \ 

E^^=^2 EVar(r?,) +0(M, 
j=i \j=i J 



a.s., 



where {W2{t),t > 0} is another standard Wiener process on the same prob- 
ability space and M„ = 'Yllt=i\'^k\- 



SPLIT INVARIANCE PRINCIPLES 



27 



We define 



bl = y^v(j2Y^)/\i, 



and 



For i £ Ik we set (t| = 6| and for i & Jk we set a'j = 0. Similarly define 
r| = hlifieJk and t| = if ^ G 4. Put 



sl = Y.4, tl = Y,rl (n = l,2,...). 

A:=l fc=l 

Summarizing our results so far we can write 



fc=l \fc=l / \fc=l 



It is a basic result that our stationarity and dependence assumptions 
imply 



(44) 



Ya.i( I \Ik\ =(7"^ + 0{k-^) and 



as A; — >• CO, for some small enough ^ > 0. It can be easily shown that (44) 
remains true if the Yj are replaced with Y-. Indeed, by the Minkowski in- 
equality we infer that 

VarV2 (E < Var^/^ (E >s) + Var^^ (E " 



< Var J ^ y ^ ^ I I 1 1 y^. - y^' 1 1 2 . 

\eh ^ ^^^^ 



Furthermore, using the definitions of the introduced constants we obtain 
max 



max \\Y, - y/||2 « i-,^' « A:-("+i)^^ 



< k-^ = A:-(i-^o)« ^ith eo < 1/2. 
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Since by definition | | <^k", we conclude that 



:,Q(eo-l/2)^ 

as /e — 7- oo. In the same manner a lower bound for Var^/^(^^.gj^ 

can be obtained. Proving the analogue of the second part of (44) for the 

is similar. 

In other words, we have shown (11) along the subsequence {in} with values 
of and that satisfy (12) and (13). The relation — a'^n\ = 0{n^~^), 
e > 0, follows by simple calculations. 

To finish the proof we have to show that the fluctuations of the partial 
sums and the Wiener processes Wi and W2 within the blocks Ik are small 
enough. Since fluctuation properties of Wiener processes are easy to handle 
using standard deviation inequalities (see, e.g., [9]), we only investigate the 
partial sums. By Corollary 1 we have 

p\ 



P sup 



i=ifc 



>i(^+'')/^] <iI'''+'^^e( sup 



k 



<C yt-(l+'?)(l+°)+"P/2 

= 0(A:-(i+^2))^ 

if £2 > is chosen sufficiently small. The Borel-Cantelli lemma shows that 
we can also control the fluctuation within the blocks. Thus (11) is proven. 

The proof of Theorem 2 is similar to the proof of Theorem 1 and will be 
therefore omitted. We only remark that under the exponential mixing rate 
logarithmic block sizes are required in the blocking argument. 

6.3. Proof of Proposition 1. We use the notation introduced in the proof 
of Theorem 1. Further we let I = /i U /2 U • • • and J = Ji U J2 U • • • and 
M„ = {1, . . . By looking at the proof of Theorem 1, it readily follows 
that 

- yi = Wl{sl)/Sn-Xn, 



where 

(45) 



^ Y,=W2{tl)/tm-Zm, 

jeJnM„ 



Xn = o{{sl)^^+'iyP-^/^) = o{l) a.s. and 
^,n = o((4)(i+^')/^-V2) = o(l) a.s. 
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Hence 



Corr(l^i(4),T^2(4)) 

= Corr(^Wi{sl),^W2{tl 



jeJnMn 

In order to calculate this correlation we need a couple of estimates. 
First we note that by the definition of and 



(46) 



4~o-^|-fnM„| and ~ cr^| Jn M^l. 



It readily follows from Proposition 4 that 



(47) 

where Cp does not depend on n. Thus 

- 5] y,-T^i(4)/s, 



sup ll-'fnilp = sup 
n>l n>l 



< sup 

n>l 



+ ||Tyi(l)||p<oo, 



and hence {^^,} is uniformly integrable. This and (45) show that Var(X„ 
0; by the same arguments Var(Zm,) — 0. By (44) 



(48) 

Thus by (47) and (48) 



ci{m,n) ■.= Cov(Zm,— ^ Y-] 

<VarV2(z^)VarV2('l ^ 



iG/nM„ 

= o(l) form,n— T-oo, 

and similarly 

C2{m,n) := Cov( Xn, — Yjj=o{l) form, 



n — )• oo. 
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Furthermore we have 



and 



5i(n):=VarV2('— ^ Yi + X^) 

>Vari/2('l Yl >^.)-VarV2(;^^ 
= o" + 0(1) for n — )• 00 

S2(m):=Vari/2('J- ^ + Z^) 



jeJnM„ 
>(T + o(l) for m —7- 00. 

Finally we introduce the term 

co(m,n) = ^^ ^ ^ Cov(yi,yj). 

We choose r > such that n £ J^+i U J^+i , and we choose v >0 such that 
m G I^+i U Jy+i and recall that by Theorem 1 we have J2iez\^°^O^o^^)\ < 
00. Hence ii v <2r we have 

co(m,n)<rt-ri ^ ^ |Cov(y„y,)| 

jgJiU---UJ^+i jgZ 

jgJiU---uJ„+i iez 



<^7r'*7''(i'^ii + --- + i«^^'+ii) 



<^s7^t~^t? =0(1) as m,n—>- 00. 



If w > 2r, we have to additionally show that 



:V E E |Cov(y„y,)|^o. 



je/lU---U/r + l i6J'2r + lU---UJi,+ l 

Now we have by (32) and assumptions (8), (9) that 



X E E lcov(i^,i^; 

le/lU-'-U/r+l jG-'2r + lU---U J„ + l 
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r+l 

7r>2r+l £=1 
r+l 

7r>2r+l £=1 

For + and vr > 2r + 1 we have constants fco a.nd ki independent 

of r and vr such that 

"+1 ^ I,, 



d(/^,J.)>A;o(vr"+^-r"+^)>A;i7r 



and thus 



r+l 



7r>2r+l 1=1 

'^D ^^^^^ '^V '^'V 



n>2r+l 

7r>2r+l 

<r-("-'^)/2 ^o(l) asr^oo. 
Using the definitions of cq, ci, C2 and -Bi and B2 we see that 

(49) Corr{W,{sl),W,{tl)) = ''^'^^'''^ + ''^'^^''^^ 

Bi{n)B2{m) 

We have shown co(m, n) + ci(m, n) + C2(m, n) + cov(X„, Zm) — )■ as m, n — 
00 while the denominator in (49) is bounded away from zero. This finishes 
the proof of Proposition 1. 
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